POINTWISE CONVERGENCE OF THE ERGODIC BILINEAR 

HILBERT TRANSFORM 



CIPRIAN DEMETER 

Abstract. Let X — (X, E,to, r) be a dynamical system. We prove that the bihnear 

series Y^'n=-N^^^~^^^^ir' — ~ converges almost everywhere for each f,g G L°°{X). We 
also give a proof along the same lines of Bourgain's analog result for averages. 



1. Introduction 

Let X = (X, S, m, r) be a dynamical system, i.e. a complete probability space (X, S, m) 
endowed with an invertible bimeasurable transformation r : X — >■ X such that mr^^ = m. 
The starting point in this discussion is a result proved by Bourgain for bilinear averages. 



Theorem 1.1 ([3]). For each f,g E L°°(X) the averages 

N-l 

N 



N-l 



ra=0 



converge for almost every x. 



Bourgain's method consists of turning the issue of almost everywhere convergence into a 
quantitative problem regarding multipliers on the torus, which are investigated by using 
classical Fourier analysis. An important reduction of ergodic theoretic nature in his 
argument concerns the fact that g can be assumed to be orthogonal to the linear space 
L^(K), where K = (X, /C, m) and /C C S is the a- algebra generated by the eigenfunctions 
of r. This is because the convergence is trivial in the case g is an eigenfunction, as 
it is easily seen from Birkhoff's ergodic theorem [2]. This reduction has the following 
consequence for the spectral behavior of g 

N-l 

hm sup|-5^(?(r-"a:)|=0, (2) 

see [1] for a proof of this and of some related results. Using this, Bourgain identifies the 
limit to be for such a g. More generally 

^ N-l ^ N-l 

lim - y /(r"x)(?(r-"x) = lim - V f{r''x)Pj^g{T-''x), 

n=0 n=0 

for each g G L°°{X), where Pj^g is the projection of g onto L^(K). A different way of 
putting this is to say that the Kronecker factor K is a characteristic factor for the almost 
everywhere convergence of the averages (1). 

In this paper we will prove the convergence of the ergodic bilinear Hilbert transform. 
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Theorem 1.2. For each f,g E L°°{X) the series 



N 



E 



n 

n=-N 



converges for almost every x. 

Remark 1.3. As a consequence of the above and of the bihnear maximal inequahty for 
very general kernels in [7], it follows that Theorem 1.2 holds for all / e U'{X)^ g e L^(X), 
whenever 1 < p, o < oo and - + -<§. 

Bourgain's approach does not seem to be applicable to the context of series, in part 
due to the fact that the characteristic factors for weighted operators other than the usual 
averages are much less understood, and probably of less relevance to the essence of the 
problem. In particular, (2) fails for the series above, and the Kronecker factor seems to 
be of no immediate relevance to the problem. 

We prove Theorem 1.2 using time-frequency harmonic analysis, and by a similar ar- 
gument we also give a new proof of Theorem 1.1. Our methods will not perceive the 
difference between the differentiation and the singular integral versions of the above, due 
to a common decomposition of both operators into discrete model sums. 

Interestingly, our argument does not appeal to characteristic factors or in general to 
any concrete spectral analysis. Moreover, only little ergodic theory is needed in the 
whole argument, when integration along individual orbits allows us to transfer certain 
oscillation inequalities from harmonic analysis. However, the structure of the Kronecker 
factor is deeply rooted into our approach. Since the (hnear) exponentials e'^^ are the 
eigenfunctions for rotations on the torus, it is probably the case that their presence in 
the wave packet decomposition of g is reminiscent of the expansion of Pj^g into a basis 
consisting of eigenfunctions for r. This also suggests that, perhaps, a time-frequency 
approach to the similar open questions concerning trilinear averages will involve quadratic 
exponentials like e*^^ , which are second order eigenfunctions for the rotations on the torus. 

Both theorems above will be consequences of the following very general harmonic anal- 
ysis result, as explained in Section 3. We will use the notation 



Modeg{x)^e''^''^g{x). 
Theorem 1.4. Let K : Ti ^ Ti be an kernel satisfying the requirements: 

i?eC~(R\{0}) (3) 

1^(01 <niin{l,l}, Ct^O (4) 



(5) 
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Then for each d = 2^/", n eN, each f,g e 
sequence of integers Ui < U2 < ■ ■ ■ < uj 



L°°(R) with bounded support and each finite 




j-i 



,00 



< j'^'\\f\\L49\\L^ 



with the implicit constants depending only on n. 

Remark 1.5. Due to the assumptions that / and g are bounded and have bounded support 
it follows that for each x e R the integral 



converges absolutely. The same remark applies to the following two theorems. 

Remark 1.6. The proof of this theorem is inspired by ideas from [4], [5] and [6]. The 
same techniques can extend the theorem to a larger range for p and q and eliminate the 
dependence on J of the bound in the above inequality. However, its current form suffices 
for our purposes. Moreover, the argument for a more general result as mentioned would 
have to be more technical and would require us to revisit most of the main results in 
[4], making the whole presentation much longer. One of the advantages of the current 
approach is that it does not rely on any type of interpolation, being a purely argument. 

Our hope is that by keeping technicalities to a minimum, the whole argument will 
become easier to follow. The interested reader is certainly referred to [4] for details on 
some of the results we are quoting here. 

As an immediate corollary of Theorem 1.4 we get a particular case of Lacey's inequahty 
for the bilinear maximal function 

Corollary 1.7 ([7]). The following inequality holds for each f,gE -£'^(R) 



Theorem 1.4 will follow from two distinct results of dyadic analysis. The first one. 
Theorem 1.8, is the particular case d = 2 oi the above and captures the main difficulty 
of the problem. The second one. Theorem 1.9, is a square function estimate and will be 
used to control error terms. 

To understand better the secrtransferconnection between these three results we intro- 
duce some notation. Let x : (0, 00) C. Let also ui < . . . < uj be as in Theorem 1.4 
and define Oi < . . . < oj such that UjU < Uj < (a^ + l)n. Then observe that 

J-l , n-l J-1 



f{x + y)g{x - y){ml,K{y) - mlu^^,K{y))dy 




Wfhhh. 




k 



2N1/2 



n-l 




i¥=3 
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Using this inequality and a dilation argument, Theorem 1.4 will follow immediately 
from the following two results. 

Theorem 1.8. Let X : R — > R &e an kernel satisfying (3), (4) and (5). Then for 
each finite sequence of integers ui < U2 < ■ ■ ■ < uj and each f,g& I/°°(R) with bounded 
support 



j-i 

3 — 1 Uj ^k<Uj_^i 



sup \ [ fix + y)g{x - y){Dill,K{y) - Dill.,^,K{y))dy\^f'^\\L^„ 



<J'"\\f\\LA\9\W, 

with some universal implicit constant. 

Theorem 1.9. Let K : H ^ H be an L"^ kernel satisfying (3), (4) and (5) and the extra 
requirement 

\Km<\^\, Ct^O. (6) 
Then the following inequality holds for each f,gE L°°(R) with bounded support 

fcez ^ 

with some universal implicit constant. 

In Section 3 we indicate how the result of Theorem 1.4 can be transfered to a similar 
inequality in a dynamical system, and how this implies the convergence in theorems 1.1 
and 1.2. In Section 4 we discretize the operator in Theorem 1.8 while the remaining 
sections are concerned with proving its boundedness. In the last section we briefly sketch 
how the same procedure can be applied to prove Theorem 1.9. 

2. Notation 

In this section we set up some notation and terminology for the rest of the paper. 

If I is an interval then c(/) denotes the center of /, while cl is the interval with the same 
center and length c times the length of /. By 1a we denote the characteristic function of 
the set ^4 C R, while for any dyadic interval /, 



A tile P is a rectangle P = IpX oop such that Ip is a dyadic interval and |/p| ■ |c<jp| = 1. A 
multitile s is a box s ~ IgXoJs,! x ■ • ■X'^s,n such that Ig is a dyadic interval, \oJs,i\ = ■ ■ ■ \'^s,n\ 
and \Is\ ■ \^s,i\ = 1- 

For each E of finite measure, X{E) will denote the set of all functions supported in E 
with ||/||oo < 1, while X2{E) will denote the normalized set of all functions supported 
in E with ||/||oo < \E\^'^/'^. Also Mf{x) — sup^>o ^ IxX \f\(y)^y denotes the classical 
Hardy-Littlewood maximal function. 

The notation a < 6 means that a < cb for some universal constant c, while a ^ b means 
that a < b and b < a. Sometimes we will write a ^parameters ^ indicate that a < cb 
with c depending only on the specified parameters. 
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3. POINTWISE CONVERGENCE FOR AVERAGES AND SERIES 

3.1. Bounded oscillation implies convergence. Assume we have a sequence Wk of 
weighted operators defined on a dynamical system X = {X, E, m, r) by the formula 



Wk{f,9){x) = Y,WKnf{T^x)g{T-''x), A; e N. 
Lemma 3.1. If for some f,g E L'^{X) 



||(^ sup |W^,(/,^)(x)-iy„,,,(/,^)(x)r)V2|U.oc< jV^||/|U.|b|U. (7) 
uniformly in J and all finite sequences of positive integers ui < U2 < ■ ■ ■ < uj, then 



\imWk{f,g){x) 

K—>00 



exists for almost every x & X. 



Proof To see this, assume for contradiction that the convergence does not hold. It follows 
that there is a measurable set X' C X with m{X') > and some a > 0, such that for 
each X E X' 

limsnpWk{f,g){x) - liminf iyfe(/, > a. 

fc— >oo fe— >oo 

An elementary measure theoretic argument shows that one can then choose a subset 
X" C X' of positive measure and a sequence of positive integers {uj)j^Tsi such that 

sup \Wk{f,g){x) - Wu^^,{f,g){x)\ > a, 

Uj<k<Uj+i 

for each j G N and for each x e X". We immediately get that for each J 

.J X 1/2 

5^ sup \W,{f,g){x) - W^^^,{f,g){x)\' \\,,^ > J^'^am{X"), 

^j^l Uj<k<uj+i J 

which contradicts inequality (7). ■ 

3.2. Proof of Theorem 1.1. Let M e N be arbitrary. We apply Theorem 1.4 to a C°° 
kernel Km satisfying 

Ifo.i] < Km < 1[-_L 

Then we invoke standard transfer methods like in [4] to get the following corollary. 

Corollary 3.2. For each d = 2^/", n G each finite sequence of positive integers 
ui < U2 < ■ ■ ■ < uj and each f,g& -^^(^) 

11(5^ sup \{A,.{f,g) + E,.^M{f,9)) - {Ar.+i{f,g) + E,^,^^^M{f,9))\Y\,oo 



. kez 

3 ^ Uj<k<, Uj _|_ \ 



<M,n J'^'WfMgh^, 
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where for each real r > 1 we denote 



0<n<r 

while 

Er,M{f:g){x) = ^Wr,n,Mf{T''x)g{T~"'x) 
neZ 

is some error term with 



Y,\^r,n,M\^M-\ (8) 



sup 

By using this and Lemma 3.1 wc find that there is a set Xq of full measure such that 
for each M e N, each d = 2^/" and each x E Xq the following limit exists 

lim {A^k (/, g) {x) + Eak Mif, g) {x))- 

Fix now some f,g E L°°(X) with ||/||c>o, \\g\\ca < 1- Based on (8) we now get that for 
almost every x E Xq 

lim.svLpAk{f,g){x) - liminf y4fc(/,g')(x) < 



A;— >oo 



fc— >oo 



< limsup{Aak{f,g){x) + E^k^Mif, g){x)) - liminf + E^k^Mif, g){x))+ 



-1 



+ hm sup Ek (/, g){x) - hm inf Ek (/, g){x) + lOn 

k—*oo fc— >oo 

<n-^ + M-\ 

Since we can take M and n to be as large as we want, it follows that 



lim Ak{f,g){x) 

k^oo 



exits almost everywhere. 



3.3. Proof of Theorem 1.2. Since the Hilbcrt kernel involved in Theorem 1.2 is not 
integrable, the route towards proving convergence in this case poses some extra difficulties. 
As a consequence, we will present a more detailed argument in this case. 
The point is again to prove the almost everywhere convergence of the series 

N 

f{T"-x)g{T^"-x) 



along lacunary sequences and then invoke the boundedness of both / and g to get the 
convergence along the full sequence of positive integers. For simplicity we choose to 
present the argument in the particular case N — 2^. 

Let M e N be arbitrary. We apply Theorem 1.4 to a C°° kernel Km satisfying 

Km{x) = — for \x\ > 1 

X 

\Km1[-i,i]\ < 2 X li_ 1 <|^|<i. 
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Introduce the discrete kernels H^^m : R — > R, k>l, defined by the following 

1 ^ 1 

-2*:<i<2'=-l ieZ\[-2fe,2'=-l] 

Let /ci < A;2 < ■ ■ ■ < be an arbitrary sequence of positive integers. The fact that the 
terms of the sequence are positive is a crucial fact, exploited in the following. Indeed, we 
note that 



|//fc,M(?/)-Dil^,XM(?/)| <M <^ f 



\y\ > 

From the boundedness of the maximal averages in Corollary 1.7 we deduce that 

^1/2 I 



1 



k>\ ^ 

<m||sup-/ |/(x + y)5(x-y)|dy||i,oo 

<\\fU9h. 

As a consequence of this and Theorem 1.4 applied to Km^ we get that 
j-i „ 

||(^ sup \ f{x + y)g{x-y){Hk,Miy) - Hkj^^^Miy))dy\'^y^^\\i,oo 

J fCj ^ /c ^ Jij -|- 1 «/ 

<M J'/1|/||2||^?||2. (9) 

The next step consists of transferring (9) to the integers. By considering functions like 
/ : R ^ R with 

[x]) : [x] + i<a;<[a;] + i 
: otherwise 



and 51 : R — > R with 

9{x) -- 



ij{[x]) : [x] + \<x <[x] + l 
: otherwise 



we get that for each 0, : Z — > Z with finite support 
j-i 

j=l kj<k<kj+\ 

<M J^/1|0||P(Z)||^||P(Z). (10) 

For each A; > 1 introduce the kernels m : Z ^ Z and S^^m : Z — > Z defined by 

0, otherwise ' 



Sk,M{i) = 



0, otherwise 
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and note that for each k < k' 

Thus (10) gives 
J-1 

j=l kj<k<kj+i 

Standard transfer to a dynamical system X = {X, E, /i, r), as in [4], leads to 
J-1 

11(5] sup I 5]/(r"a;)5(rt)(0,,M(n) - 0,^^,,M{n))m\i,oo 

j=l kj<k<kj+-i 

<M J'"\\fh\\gh. (11) 

By invoking Lemma 3.1 it follows that if ||/||loo, Hs'IIloo < 1 

hm V/(T"x)^(T-"x)Ofe,M(n) 

k^oo '—^ 

exists for almost every a; G X. Finally 

limsup J] f{r''x)g{T-''x)Sk,Min) - liminf ^ f{T'^x)g{T-''x)Sk,M{n) < 
''^^ nez nez 



< limsup ^/(t"x)^(t ''x)Ofe,M(ri) -liminf ^/(t"x)c/(t ''x)Ok,M{n)+ 
^^"^ nez ''"^ nez 

+ limsup J2 fir''x)g{T-''x)Ak,M{n) - liminf ^ /(r"a;)y(r-''a;)A,M( 

fc— »oo ^„ fe— >oo 



neZ " neZ 

< CM-\ 

Since this holds for arbitrary M, we get that 

/(r"a:)g(r-x) 

exists almost everywhere. 

4. Discretization 

Definition 4.1. A set Q' of (not necessarily dyadic) intervals is called a grid if 

• /, /' e Q' and |/'| < |/| imply that either I' C I or I (1 1' ^ $ 

• I & Q' implies that |/| = 2^^, for some k & Z. 

The standard dyadic grid is 

S = { [Tl,2\l + 1)] : i,leZ}. (12) 

We will also be interested in a more general type of grid. For each odd integer N > 3, 
each < i < AT - 2 and < L < AT - 1, the collection 



m,t,L 



■.i = t (mod TV - 1), / e Z 
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is a grid, as it easily follows from the fact that 2^~^ = 1 (mod A^). We note that for each 
fixed N the grids QN,t,L are pairwisc disjoint, for < t < — 2 and < L < N — 1. 

For each kernel K, each f,g & L°°(R) with bounded support and each sequence 
U = {uj)j^i define the quantity 



OK,vif,9){x) 

j-i „ 

In the following we indicate how to discretize it. Choose 77 : R — > R such that rj is a. 
C°°(R\ {0}) function which equals lim^_>o+ -^(0 (O; |]) l™?^o- -^(0 on [— 1,0) and 
outside [-1, 1]. The two hmits exist due to the fact that \^K{^)\ < 1 for ^ 7^ 0. It 
suffices to prove 

||||0,,u(/,^)||l,oo< J'/1|/||2||^||2 (13) 
||||0^_,,u(/,^)||l,oo< J^/1|/||2||5||2. (14) 

The proofs for the above inequalities will follow from a more general principle, as explained 
below. The crucial property of the multiplier K — r] that will be used later is the following 

l|!^^(OI<^niin{|e|,^}, n>0. (15) 

Note that the additional inequality \K — 'r){^)\ < |^| for ^ 7^ is a consequence of the fact 
that 1^^(01 ^ 1 for ^ 0. Write 

= E ^(o?(^), (16) 

j=-oo 



where q is some Schwartz function supported in the annulus | < |^| < 2 such that 

-2i' 



Define gj = K — r]{^)q{-^). As a consequence of (15) we get that both the function 77 and 
each function Dil23 gj will satisfy 



\V{X)\ <M 



1 



(1 + N) 



M 



\2j^^^2^^^ (1 + \x\)^' 

for all M > and a; G R, uniformly in j G Z. Moreover, each of the above functions has 
the Fourier transform constant on both {0 < ^ < |} and { — | < ^ < 0}, as well as on 
{|el>2}. 
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For each j G Z define the shifted sequence U'j — IJ — j . Since the operators Og-.^jj and 
^Dii^ c/j,u' coincide, inequahties (13) and (14) will immediately follow if we prove that 

\\OeMf^9)hoo<J'^'\\fh\\9\\2, (17) 

uniformly in all C°°(R \ {0}) functions 6 which are constant on both {0 < C < and 
{ — I < < 0}, as well as on {|,^| > 2}, and which satisfy 

1^(^)1 (i+|^|)M (18) 

for all M > and xeR. 

By a similar reduction we can assume instead that 9 is constant on {\x\ < 1000} and 
on {|a;| > 4000}. This extra assumption will serve later for the purpose of creating 
disjointness of some sort between multitiles. Note that 

i>k 

where ^^(0 = 6 {2'^) -6(2'+^^) is supported in the annulus {500 x 2"* < |^| < 4000 x 2~'}. 

Pick a Schwartz function ip such that ^ is supported in [0, |] and satisfies the following 
property for every ^ G R 



E 



For each scale i use the following expansion for both / and valid in every LP norm, 
1 < p < oo 

m,leZ 

where 

i^i,mA^) = 2-m2-'x-m)e'''''-'^K 
The fundamental properties of ipi,m,i that will be used in the following are 

■ Mod_,2-«^.,^X^)l<M2(-V2-")^— — ^, M>0 (19) 



supp^,,^,iC[V\(^ + l)2-^. (20) 

The above properties can be summarized by saying that the tile [m2*, (m+l)2'] x [|2~*, (|+ 
1)2^*] is a Heisenberg box of ijji,m,i- 
With the notation 

:=2'/2y" V.^^^^(x + y)V',_^^^^(x-y)^;(y)cit/ 

it follows that 
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i=l kj<k<kj^i t=k rnXeZ'^ 

Triangle's inequality then shows that Theorem 1.8 follows once we prove that 

II EE IE E 2-''(/'^.^.^)(^'^...^)^.n./)^/li,oo 

< ^'/1l/l|2|bl|2. (21) 

The computations that follow are meant to reveal the decay and localization of the 

functions (f^^^^f. 

We first observe that since 



it turns out that 



SUpp^._^ f C SUpp^pi^rniM + ^^PP i^i,m2,l2 ^ [-2 \ (- + 1)2 (22) 

where from now on we will denote 

h = h + k- (23) 
We next observe that for each M > 



a + Ix - mi|)^(l + \x- msl)^ 
1 



(1 + |mi - m2|)^(l + \x- nn±m|)M- 
A similar estimate holds for all derivatives, and we conclude that 

1 

I— Mod_^2-99^,m,K^)l 2(-V2-n> ^ ^ m2|)^(l + |2-% - m±m|)M' 



(24) 

for each n, M > 0. 

We thus see that </?j ^ ^- satisfies the same type of properties as ■?/'i,m,i, with some extra 
uniform decay in m = \mi — m2\- In particular, if /j,^ is one of the (at most two) dyadic 
intervals of length 2* which contains HHimi^ then x [^2~*, (^ + 1)2~*] is certainly a 
Heisenberg box for ld- -, ,-. 

Finally, since 

^.^,K^) = / V^,.,ii(ei)V^,.„i.(6)^.(6-ei)e^™^«^+«^)cieici6, 
it follows that in order for </? . - to be non identically equal to we must have 

102<||-||<10^ (25) 
5 5 



12 
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The next reduction concerns the fact that there is a finite universal set e Z, such 
that every every Zi, Z2, ^3 satisfying (23) and (25) will also satisfy the following 

k^h + e (26) 

— * 

for some e e E. We will restrict the summation in (21) to those vectors I satisfying (26) 
for some fixed e. 

For each le T? and each i e Z define the cubes Q^. = ifj^A^^-', (^ + 1)2"^]. Note 
that every frequency interval [2~*-|-, 2~*(^ + 1)] of a cube Q^-. belongs to one of the grids 
G5,t.L, 0<t<3,0<L<4. This allows us to further restrict the summation in (21) to 
those / and i for which each interval [2~*^, 2~*(^ + 1)] is in a fixed grid depending on j. 

Denote by D' the union over alH e Z and all h^hih G Z which are subject to all the 
indicated restrictions, of the set of all the cubes Q^--. For each m > introduce the set of 
generalized multitiles to be 
3 

{P = \{lp,jXQii ■■ Ip,j = [mj2\ {mj+l)T], j < 2, /p,3 = 7^,^, Q^-. e D', jmi-msl = m}. 

Each such multifile can be thought of as the product of 3 tiles Pj := Ipj x (/j + l)2~*]. 

For each multifile P as above define ijjpj = i^i^rnj,^ for J = 1, 2 and ■0p,3 — '^ifh v Thus (21) 
will follow if we prove that 

IKX; sup I \ip\-^'\!.i'p^)MpMpA^f'^\V,oo 

where \lp\ is defined as being the common value of all |/pj|. 

As it will easily follow from our later analysis, it is enough to prove the above for m = 0. 
The extra decay in m for the other terms will be a consequence of the extra decay in (24). 
Once we restrict attention to the case m = 0, we remark that Ip^\ — Ip^i — Ip^^, however 
this observation will not change or simplify our argument. 

A last harmless reduction consists of sparsifying the set of time dyadic intervals. More 
precisely, we will assume that if -pj > 1 then > 2^, where A is a sufficiently large 
constant to be chosen later (A = 1000 will certainly suffice). 

We now summarize all the various reductions made so far in the following theorem, 
which implies Theorem 1.8. 

Theorem 4.2. Let Q, Qi, Q2 and he four grids with Q satisfying 

GdS (27) 

/,/' G 6? ^ max{|/||/'|-MJ'||J|-i} >2^. (28) 

Let e he a numher with 10^ < |e| < 10^ and define 

3 . . 3 
D = (f + l)2i eY[Qr.l2 = li + e, h = h + h}. 
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Define also the set of multitiles 

S = {s = Is X Qs : Is ^ Q , £ D with sidelength -rj-r}- 

\ s\ 

Assume that each multitile s — Ig x 11^=1 <^s,3 & S is associated with three functions 
(V'5j)j=i satisfying 

I— Mod_,(,^,.) xl^sA^)\ <n,M |/.r'/'-"xf , n,M>0 (29) 

SUpp iJs J C LJsJ, (30) 

for each j = 1, 2, 3. 

Then for each f,g& L^{^) o,nd each finite sequence of integers \J :— ui < U2 < ■ ■ ■ < uj 
we have the estimate 

IKE I E l^^r'^'aV'M)(^,V'.,2)V^.,3r)'/1l,oo< J'/I|/I|2||y||2, 

with the implicit constant depending only on the implicit constants in (29). 

For each 1 < j < J — 1 let : R — {uj,Uj + 1, . . . — 1} be some arbitrary 

stopping time. For each s e S with 2"^ < \Is\ < 2"-^ define 

for j e {1,2} and 

where j is the unique integer such that 2"^ < \Is\ < 2"^+^. An equivalent formulation 
for (31) that we will sometimes find easier to handle is 

IKE I E if, <l>s^){9 AsMs,3m\i,oo < J'^'WfhWgh. (32) 

7=1 ses 

2"i<|/,|<2"i+i 

From now on wc will fix the collection S, the wave packets ips.j, the sequence U and 
the stopping times Kj, and we will implicitly assume that for each s G S we have 2"^^ < 
\Is\ < 2"^. 

5. The combinatorics of the multitiles 

In this section we start by defining a relation of order between multitiles. This will alow 
us to split S into structured collections like trees and forests. The model sum restricted 
to each tree is essentially a Littlewood-Paley dyadic decomposition modulated with a 
frequency from the frequency interval of the top of the tree. Estimates for the model sum 
restricted to each such tree involves classical Calderon-Zygmund theory. This will be seen 
in Section 6. The modern time- frequency side of the whole approach manifests in the 
fact that S consists of many trees, modulated with possibly different frequencies. The 
goal of this section is to prove that the model sums corresponding to distinct trees are 
almost orthogonal, a principle quntified in various Bessel type inequalities. The almost 
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orthogonality will follow if the trees are selected to be strongly disjoint, a combinatorial 
property which, as the name suggests, is stronger than mere disjointness. 

Definition 5.1. For two multitiles s,s' &S we write 

• s'j < Sj if Is' ^ Is and Us,j ^ 0Us',j 

• s'j < Sj if s'j < Sj or Sj = s'j 

• ^'j ~ ''f ^s' ^ Is and Usj C lOeUs'j 

• s'j <' Sj if s'j < Sj and lOugj n lOa;^' j = 0, 
where e is the constant in Theorem 4-2. 

Lemma 5.2. Given any two multitiles s,s' & S such that s'^ < Si for some i e {1, 2, 3}, 
it follows that s'j <' Sj for each j e {1, 2, 3} \ {i}. 

Proof We illustrate the proof on a particular case which is otherwise representative for the 

general argument. Consider some Qs = n?=i[|2% (| + 1)2^], = lfj=Ah'^ (| + 1)2'] 
and assume that s'^ < Si. This implies that 

\l'i-2'-''li\<5. (33) 

Assume now for contradiction that u!s,2 ^ lOeuJs',2- This in turn implies that 

|;^-2'-^'/2| > lOe. (34) 

Inequalities (33) and (34) together with the fact that l2 — h = l'2~ I'l = ^ will immediately 
lead to the contradiction. 

Assume next for contradiction that 10u!s,2 H 10a;s',2 7^ 0- This in turn implies that 

\l'2-2'-''l2\< 50. (35) 

Inequalities (33) and (35) together with the fact that I2 — I1 — 1'2 — I'l ~ e will immediately 
lead to the contradiction. ■ 

Let i,j e {1,2,3}. 

Definition 5.3. A collection of multitiles T (Z S is called an i-tree with top T & S if 

Si < Ti for each seT\ {T}. 
We say that T is a tree if it is i-tree for some i. We call the tree T j-lacunary if 

Sj ^' Tj for each s e T \ {T}. 

Remark 5.4. A tree does not necessarily contain its top. By Lemma 5.2 we know that a 
tree is j-lacunary if and only if it is an i-tree for some i ^ j- Note also that each tree can 
contain at most one multitile with a given time interval. 

A very useful tool for proving estimates for a single tree T is its size, a quantity which 
encodes the BMO properties of the model sum associated with T. 

Definition 5.5. Consider some j G {1,2,3} and a finite subset of multitiles S' C S. 
Assume that each s G S' is associated with a function : R ^ C. Define the j-size of 
S' relative to the collection (Fg) by the formula 



sizej (S') = sup I ^ I {Fs, (ps,^ \ 



POINTWISE CONVERGENCE OF THE ERGODIC BILINEAR HILBERT TRANSFORM 



15 



where the supremum is taken over all the trees T e S' of lacunary type j. 

To simplify notation, we will not add (F^) as a superscript of size, since (F^) will always 
be clear from the context. Actually the 1-size and the 2-size will always be understood 
with respect to the functions / and g, respectively. 

Note that size is a monotone function with respect to S'. The following lemma shows 
how to estimate tree paraproducts by using the size. 

Lemma 5.6. // T is a tree then 

El^^r'^'IlK^-'^^.^)! ^ I^Tinsize,(T). 

seT i=l 1=1 

Proof Assume T is an i-tree. Apply I2 estimates for the terms corresponding to j, k e 
{1, 2, 3} \ {i} and Zoo for the terms corresponding to i. ■ 

Definition 5.7. Let j G {1, 2, 3}. Two trees T and T' with tops T and T' are said to be 
strongly j -disjoint if 

• Sj n s^. = for each seTU {T}, s' e T' U {T'} 

• whenever s G T U {T},s' G T' U {T'} are such that cusj C cus'j, then one has 
It n Is' = 0, and similarly with T and T' reversed. 

A collection of trees is called mutually strongly j -disjoint if each two trees in the collection 
are strongly j- disjoint and each tree is j-lacunary. 

Remark 5.8. Each subset St C T of an i-trcc T can be decomposed in a unique way as 
the disjoint union of i-trees T' containing their tops 

St= U T', 

T'eZ)(ST) 

such that these tops have pairwise disjoint time intervals. Precisely, the tops of the trees 
in the collection D{S^) will be the maximal multitiles in St with respect to the order <. 
If one has a collection JF of mutually strongly j-disjoint trees T and each T is associated 
with a subset St C T, then the decomposition 

^=U U 

T&J" T'gD(St) 

gives rise to a family of mutually strongly j -disjoint trees. 

The concept of strong disjointness is the key ingredient behind the phenomenon of 
almost orthogonality responsible for the following Bessel type inequalities. 

Proposition 5.9 (Nonmaximal Bessel's inequality, see [6]). Let j G {1,2}. Consider 
a collection S' <^ S of multitiles and let sizej(-) denote the j-size with respect to some 
function F G L^(R). Then S' can be written as a disjoint union 

s' = s; u s^, 

where 

■ ^ sizej(S') 
size,(Si) < ^ 
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while S2 consists of a family J^s'^ of pairwise disjoint trees satisfying 

^ |/t| <size,(S')-lF||^, (36) 

with the implicit constant independent of S' and F. 

For J = 3 we will need the following version of the above. We will call forest any 
collection T of strongly 3-disjoint trees and will denote by 

the counting function of 

Proposition 5.10 (Maximal Bessel's inequality). Let S' C S 6e a collection of multitiles 
and let \J := Ui < U2 < ■ ■ ■ < uj be an arbitrary sequence of integers. For each s G S' let 
j{s) de the unique number in {1,2, J — 1} such that 2"j(^) < \Is\ < 2"^('')+i. Consider 
also an arbitrary sequence of functions hi,h2, ■ ■ ■ , /ij-i : R — > C satisfying 

J-1 

and a function h G X2{E), where E is an arbitrary set of finite measure. Let size3(-) 
denote the 3-size with respect to the functions i^^ : R ^ C defined by 

Then S' can be written as a disjoint union 

s' = s; u s^, 

where 

size3(Si) < 

while S2 consists of a family !Fs'^ of pairwise disjoint trees satisfying 

E|^.|^.-..e3(S')-(-^)' (37) 

for each e > 0, with the implicit constant depending only on e. 

This proposition will follow from a chain of successive reductions, as in [4] . 

Proposition 5.11 (Maximal Bessel's inequality, first reduction). Assume S' C S can 
be organized as a forest T' of trees T with tops T. Let \J := ui < < ... < uj 
be an arbitrary sequence of integers. Consider also an arbitrary sequence of functions 
hi, h2, . . . , /ij-i : R — >■ C satisfying 

J-1 
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and a function h e X2{E), where E is an arbitrary set of finite measure. Define the 
functions Hg as before. Assume also that 

1/2 



for each T e , and 



( 



1/2 



It' — 



sex 



< 2 



m+l 



for each T' e T e JF'. 



/or eac/i e > 0, with the implicit constant depending only on e. 

Proposition 5.11 implies Proposition 5.10 by a standard tree selection algorithm, see 
[6]. 

The next reduction allows to replace the dependency on with a dependency on 

the counting function multiplicity. By linearity we can also eliminate the size parameter 
2m 

Proposition 5.12 (Maximal Bessel's inequality, second reduction). Assume S' C S can 
be organized as a forest T' of trees T with tops T. Let \J :— Ui < U2 < ■ ■ ■ < uj 
he an arbitrary sequence of integers. Consider also an arbitrary sequence of functions 
/ii, ■ ■ ■ , /ij-i : R ^ C satisfying 

J -I 

(38) 

and a function h e L^(R). Define the functions Hg as before. Assume also that 



1/2 



< 2 



(39) 



for each T e !F' , and 



\ 



1/2 



< 2 



(40) 



for each T' e T e !F' . Let Iq be an interval which contains the support of Nj^i. 
Then 



(41) 



for each e > 0, with the implicit constant depending only on e. 
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To see how Proposition 5.12 implies Proposition 5.11 we first introduce the BMO norm 
of a forest T as 

II^IIbmo := sup — ^ |7t|, 

where the supremum is taken over aU the dyadic intervals /. We then recall the following 
result from [4]. 

Lemma 5.13. Let be a forest such that for some e < 1 and some A,B > 

WN^'h < A\\Njr,\\l^ and ||j^'||bmo < ^II^V^'ll^ 
for all forests T' C T . Then we have 

Proof [of Proposition 5.11 assuming Proposition 5.12] Let C he arbitrary. Prom 
Proposition 5.12 with h replaced by and /q chosen to be so large as to contain all 

the time intervals arising from J-'', we have 



\N^'\\i = 



< Yl \{Hs/2-,<Ps,3)\' 

< jl/82-2m||^^,||^ 

2 



thanks to the L normalization of /i £ X2{E). 

On the other hand, if Iq is an arbitrary dyadic interval, then by replacing with 
{T e J^' : It Q Iq} in the above argument we see that 

-'0 ^^irr^r -'0 J 



T&T'-.ItQIo 



<J^'''\\N:p,fJl-''^\E\-^ 



thanks to the uniform bound of on / G X2{E). Taking suprema over Jq we 

conclude that ||J?-''||bmo Applying Lemma 5.13 we conclude 
that 

Y \It\ = <6 J'/'2-2"*(2-2"*|£;|-^)T^. 

Given the fact that e > is arbitrary, the proof of Proposition 5.11 follows. ■ 

We next focus on proving Proposition 5.12. We will borrow some of the terminology 
from [4] in order to quote some results from there. Let Vo,'Di,V2 be the dyadic grids 

Vo := 2'\l + 1)] : i, / G Z} (i.e. the standard dyadic grid) 

Vi := {[2~\l + (-1)73), 2-\l + 1 + (-1)73)] : / G Z} (42) 

V2 := {[2-\l - (-1)73), 2-\l + 1 - (-1)73)] : ^, i G Z}. 
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One can easily verify that for every interval J (not necessarily dyadic) there exists a 
d e {0, 1,2} and a shifted dyadic interval J' e such that J C J' C 3 J; we will say 

that J is d-regular. 

Let A > 1, and let d e {0, 1, 2}. We shall say that a collection X C I^o of time intervals 
is {A, d)-sparse if we have the following properties: 

(i) If 1,1' eX are such that |7| > then |7| > 2^^^^\r\. 

(ii) If LI' eX are such that |/| = |/'| and / ^ /', then dist(/, /') > 100A|/'|. 

(iii) If / G X, then AI is rf-regular, thus there exists an interval I a G X)^, such that 
AI Ia ^ SAL We refer to I a as the A- enlargement of /. 

If X is an {A, (i)-sparse set of time intervals and P is a tile whose time interval Ip lies 
in X, we write IpA for the A-enlargement of Ip. We recall the following two results from 
[4]. 

Theorem 5.14. Let A,D>1 and let T' he a forest with \\N'.p\\oo < X). Let also S' :— 
Ut6^' ^'^^ suppose that the time intervals 

{Is-.se S'} U {7t : T e JT'} 

are {A, d)-sparse. 

Xhen there exists an exceptional set S* C S' of multitiles with 

lU^^IS(>l"'^ + ^-'^)El^^l (43) 

such that we have the Bessel-type inequality 

E K/>^,3)r S {ilogi2 + AD)r + A''-D'')\\f\\l 

sGS'\S. 

for each v > 1 and each f e L^(R). 

Lemma 5.15 (Sparsification). LetX he a collection of time intervals. Xhen we can split 
X = Xi U . . . U Xl with L — 0{A^) such that each X; for 1 < I < L is {A, d)-sparse for 
some c? = 0, 1, 2. 

Proof [of Proposition 5.12] Wc start by noting that it suffices to prove Proposition 5.12 
without the localizing weight x^i^^- This is because Xi^^ is a polynomial and hence Xiq^'4's,z 
satisfies the same properties (29) and (30) as ^^^3. Let > 20 be arbitrary. We first apply 
the above lemma with A — C^{J\\Nj^f\\^Y/'^ to the collection X :— {L : s & S'} to split 
X into Xi, . . . ,Xi, for some L = O(A^). Each tree T G J^' will be disintegrated over the 
collections X/. A further disintegration occurs by differentiating multitiles according to 
their spacial length, so in the end 

T= U U St,,,- 

1<1<L 1<1<J-1 

Here 

St, i,j = {s G T : 7, G Xi, 2"^ < |7,| < 2"^+^} 
According to the Remark 5.8, each collection 

^i. = U U T' 

TeJF'T'eD(ST,ij) 
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is a forest. It is easy to see that each Ti^j satisfies the requirements of Theorem 5.14 with 
A as above, and moreover 

||A^^,,||oo < II A^^' II CO 

< \\Nr\\i- 

Define 

^^.i = U 

By applying Theorem 5.14 to each forest Tij with D — C^J\\N JF'\oo-i f — hhj and 
u — 200/x we get an exceptional set S/j,* such that 

I U Is\<,C;:^'"'J-'\\N:rrJ E I^tI (44) 

and we have the Bessel-type inequality 

E l(/^/^i,0.,3)|^<.(aog(2+||7V^HIoo))^V(log(2 + J))^°)||/i/i,||i (45) 

Define now 

S* := ^'j,* 

and note that if is chosen sufficiently large then 

I U ^sl < -TT^- (46) 
By summing up in (45) and invoking (38) we get 



J2 \{Hs,cl>s,3)f <M ||iV.F'||^^^((log(2 + ||iV^,||oo))'°+ (log(2 + J)) 



2 



Write n := (J^es, ^s- Then 

E \iHs,<l>s,3)\' < E \iHs,<l>s,3)\'. 

To prove (41), it thus suffices to show that 

E i(^^''^^,3)r<^EK^-'^^.3)r. (47) 

seS'-.isCn seS' 
Prom (39), it thus suffices to show that 

E \{Hs,<Ps,3)\'<l\\N:r'\\v 
seS'-.iscn 

For each tree T in consider the multifile set {s e T : C fi}. If s is any tile in this 
set with 7s maximal with respect to set inclusion, then C and from (40) we have 



E \{Hs'As',3)\'<Mls 



s'eT:I,CI, 
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Summing this over all such s upon noting that the Is are disjoint by dyadicity and maxi- 
mality, we conclude that 



Summing this over all T e we obtain 



J2 \{Hs,4>s,3)\'<MiTnn\=4 j i,,. 

r:/,cn •'^ 



l(^s,0s,3)r<4 I iV^. <4|fi|||iV^, 



and the claim (47) follows now from (46). ■ 

6. Single tree estimate 

Consider a 3-lacunary tree T and some coefficients {cs)se_T- The following representation 
holds for each 2^ < \It\, assuming A is chosen large enough 

Y ^s^sM = j :fCY Csi^sM)a'^\i - cM))e^^'^''di. (48) 

sGT:|/s|>2'= seT 

Here C, is some universal function equal to 1 on [— lOOe, lOOe] and equal to outside 
[-200e,200e]. 

Theorem 6.1 (Single tree estimate). Let T be a 3-lacunary tree in S with top T, and let 

Vt = {I dyadic : /s C / C J^/ for some s, s' G T} 

be the time convexification ofT. Consider a finite sequence of integers ui < U2 < ■ ■ ■ < ul- 
For each s & T let l{s) be the unique number m {1, 2, . . . , L — 1} such that 2"'(^) < 
< 2"'(=)+i. Consider also an arbitrary sequence of functions hi, /i2, ■ ■ ■ , : R — > C 
satisfying 

L-l 



1=1 

and a function h e X{E), for some C R of finite measure. For each s e T define 
Then 



MtI J I&Vt 1^1 Je 



with some universal implicit constant. 
Proof 

By frequency translation invariance we may assume that G ujt,3- If h is supported 
outside 2It then from the decay of ^5,3 we get 

10 



This together with crude estimates based on the triangle inequality immediately prove 
the Theorem in this case. 
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By invoking duality it hence suffices to prove for all h supported on 2It and all {agjseT 
with ||(as)lb2(T) < 1 "that 



2"!<|/sl<2"'+l 



Prom (48) we know that 



Define 



E (^s(l>s,3{x) \ = sup I E as'llJs,3{x) 
sex ui<k<ui+i 

2«i<|/,|<2"!+l 2fc<|/,|<2"i+l 

<M( E ^sM^)). 

sST 
2"I<|isl<2'''+l 



sex 

2"i<|Js|<2"*+l 



We will prove that 



L-1 



^ lev-T \i \ Je 

For a dyadic interval J denote by Ji, J2, J 3 the three dyadic intervals of the same length 
with J, sitting at the left of J, with J3 being adjacent to J. Similarly let J5, Jq, J7 be the 

three dyadic intervals of the same length with J, sitting at the right of J, with J5 being 
adjacent to J. Also define J4 = J. Let J' be the set of all dyadic intervals J with the 
following properties: 

(a) J n 2/r 7^ 

(b) $1 eVT-- \I\ < \ J\ and I dZJ 

(c) Ji e Vt for some 1 < i < 7. 

Wc claim that 2It C UjgjJ. Indeed, assume by contradiction that there exists some 
X E 2It \ UjfzjJ. Let J^^^ C J'^^^ C J*-^-* C . . . be the sequence of dyadic intervals of 
consecutive lengths containing x, with \J^^^\ = min/g-p^ \I\. Since J^'^^ ^ JT" and since (a) 
and (b) are certainly satisfied for J^^\ it follows that jj;^^ ^ Vt for each I < i < 7. 
Moreover, note that for each 1 < i < 7 there is no / e Vt with I C J-^\ We proceed 
now by induction. Assume that for some j > we proved that for each 1 < i < 7 
we have J^'^^ ^ Vt and also that there is no / G Vt with / C Note that this 

implies the same for j + 1. Indeed, since 3J^^^^^ C 7J^^^ and by induction hypothesis, 
it follows that (b) is satisfied for Hence .//^^^ ^ Vt for each I < i < 7. We 

verify now the second statement of the induction. Note that if there was an / G Vt with 
/ C J-^'^^^ than the hypothesis of the induction and the fact that 3 J^^^^^ C 7J^^^ would 
imply that i G {1, 2, 6, 7}. Hence / C J-^^^^ C It, and by convexity of Vt it would follow 
that J^~^^^ G Vt, impossible. This closes the induction. To see how the claim follows 
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from here, observe that It — J-^^ for some which certainly contradicts the fact that 
It e Vt- 

Next thing we prove is that on each interval 2 J with J ^ J ^ the oscillation of 
(^^/ |Fj|^)^/^ is well controlled. More exactly we will show that 

L-l 



sup \F,{x)-F,{y)\yl^<^. 

— x,y€2J J 2 



1=1 

Indeed, for each x,y e 2 J we use (29) and the fact that e u!t,3 to get 

L-l 

sun I PA tI\ — sun 

dx 



x,ye2J ^g^j;e2J 



/J 2 



since by definition there exists no s e T such that Is C 3J. Now 



< 1 



J 2 



and also 



l^sKIJI \Is\<\J\ ' "~iF 

1 



< 

I J|2 

due to the fact that there exists no C 3J with < | J|. 

Define now the measure space X = VJj^jJ and its sigma-algebra T generated by the 
maximal intervals J & J . Recall that 2It C Uj^jJ = X C 10/^. We will see that for 
each X & J 

(Y M{F,f{x)f/^ ^TTl l^T. M{F,f{z)f"dz + (49) 
1=1 l-^l J J 1=1 

Indeed, if > || J| then 

L— 1 ^ rx+ri 

1=1 ^^jJx-n "--^ i=i 



/ |F,|(.)d.)^)V^ < inf(J]M(F,)^(y))V^ 



< 
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On the other hand, if r; < ^\J\ then 



L-l 



2ri 



x+ri 



x-ri 



L-l 



ye2J 



1=1 

L-l 



y^J |J|2 



1=1 



Ml jj 



From (49) we can write 



sup — 



\It\~^ J 1=1 \It\-^ Jx 

= -i^ / E(/.|T)E((X]M(F,f)V2|T) + sup^ / \h\ 
|/t|2 Jx ^ JeJ Kl J J 

<-1t||E(/i|T)|U. / E((X;M(F,f)V2|T) + sup^ 

L-l 

e{{J2m{Fi)Y'\V 



Xj 



< 1 + 



1 / 2 

sup^ / Xj 



^ sup 



where E(-|T) denotes the conditional expectation relative to T. Finally, note that since 
for each J E J, Ji E Vt for some i, we have that 



sup 



\j\L 



Xj ^ sup 



JeJ \J\ Je leVT m Je 
which certainly ends the proof of our theorem. 



7. The proof of Theorem 4.2 
For each f,g e LF'^R) and each subset of tiles S' C S define 

j-i 

Ms'{f,9){x) - {Y, I E \Is\-"\fAs,i){9AsMs,^?) 

j=l sSS' 

2"j<|/s|<2''j+l 

In order to prove inequality (32) it suffices to show that 

71/4 

|{x:Ms(/,^)(x)>A}| <^, 
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uniformly for each /, g with ||/||2 = ||g'||2 = l and each A > 0. We claim that it suffices to 
prove the above for A ~ 1. Indeed, for arbitrary A choose /c e Z such that 2^^ < A < 2'^+^. 
Also, for each multifile 

^ / / 

s = [m2\ (m + 1)2^] x n[-2"\ (- + 1)2"1 e S 

define 

^ / / 

s{k) := [m2*+^ (m + 1)2^+^] x ^[[^2-'-^ + 1)2-'"'=] e S. 

j=i ^ ^ 

Define also the collection of multitiles 

S{k) = {s{k) :seS}, 

and the functions 

1 X 

Note that S(A;) and satisfy all the requirements of Theorem 4.2 (with a different 

choice of grids), with the same implicit constants in (29). The claim now follows by noting 
that 



\Isr'/'{fAs,l){9As.2)(l>sM = \Isik)\-'^Hml, /,0,,(fc),i)(Dil2,(?,0,(,),2)2'=0,(fe),3(2^,T). 

Define now T = max{[— log2(sizei(S))], [— log2(size2(S))]}, where the 1-size is under- 
stood here with respect to the function / while the 2-size is understood with respect to 
g. By iterating Proposition 5.9 simultaneously ior F — f and F — g, it follows that S 
can be written as a disjoint union S = Un>r ^n, with 

size,(|J S„) <2-" (50) 

m>n 

for J = 1,2, and each S„ consists of a family Ts„ pairwise disjoint trees satisfying 

The contributions coming from the collections S' = Ur<n<o and S" = Un>o ^^^^ ^® 
evaluated quite differently. 

In the case of S', crude estimates will suffice. Let T e J^Sn- By using (50), the decay 
in (29) and the triangle inequality we immediately get the estimate 

M^{f,g){x)<2''-x%{^) (52) 
for each x ^ 2/^. For each F < n < and each T G J-'s„ set = 2^"/^. From (51) we 
get that the exceptional set E — Ur<n<o-^" measure < 1. Also, (52) imphes that 

\\Ms'{f,g)\W(E^)< \\MsM,9)\\me^) 
r<n<o 



r<n<o 



< 1. 
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We conclude that 

\{x:Msif,g)ix)>l}\<l. (53) 

We will focus next on the estimates for the collection S". This time we will rely on the 
fact that sizei(S") < 1 and size2(S") < 1. As before, for each s e S let j{s) denote the 
unique number in {1, 2, . . . , J — 1} such that 2"j(^) < \Is\ < 2"j(^)+i. The 3-size will now 
intervene in a crucial way. Define 

y := {x:Ms>,{f,g){x)>l}. 

If \V\ < 1 there is nothing to prove, so we will assume \V\ > 1. Let /ii,/i2, ■ ■ ■ , ^j-i '■ 
R — > C be functions satisfying 

j-i 

such that 

Ms"{f,9){x) = Yl \Is\~^'^{fAs,l){9As,2)hj^s)^s,^{x). 

ses" 

From Theorem 6.1 wc know that the 3-size of the collection S" with respect to the 
functions Hg := |y|~2 ly/i^^^-j is < 1. There is actually no restriction in assuming it is < 1. 
By applying iteratively the propositions 5.9 and 5.10 to the collection S" it follows that 
S" can be written as a disjoint union S" = Un>o ^'^^^ 

size,(S:) < 2-- (54) 

for j G {1, 2, 3}, and each consists of a family jFg// of pairwise disjoint trees satisfying 

|/t|<J^/'2K (55) 

Finally, by Lemma 5.6 

\V\"^<{Ms:>{f,g)AV\-hy) 

= \is\-''^fAs,i){gAs,2){HsAs,z) 

ses" 

^ E E \h\-"\fAs,i){gAs,2){H,,<t^,,^) 

n>0 TGJP'c// 



n>0 



We conclude that 



\{x:Ms',{f,g){x)>l}\<jy\ (56) 
The estimates in (53) and (56) end the proof of Theorem 4.2. 
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8. The proof of Theorem 1.9 

By applying discretization techniques like in Section 4, Theorem 1.9 will follow from 
the following. 

Theorem 8.1. Let Q, Qi, Q2 and Qz he four grids with Q satisfying 

1,1' eg ^ max{|/||/'|-\ |/'||/|-^} > 2^. 

Let e be a number with 10^ < |e| < 10^ and define 

3 3 

D = {n[|2\ (| + 1)2^] eflgr- l2 = h + e, k = h + k}. 

j=i i=i 
Define also the set of multitiles 

S = {s = Is X Qs '■ Is ^ g , <5s G D with sidelength -r—r}- 

\ s\ 

Assume that each multitile s — Ig x Y['j=i'^s,3 & S is associated with three functions 
{ips,jfj=i satisfying 

supp ipsj C Us,j, (58) 

for each j = 1, 2, 3. 

Then for each f,g & L'^(Fi) we have the estimate 

IISUPI J2 l^sr'^'(/,^M)(^,^.,2)^.,3|')'/'||l,oo<||/||2|b||2, (59) 
\Is\=2'' 

with the implicit constant depending only on the implicit constants in (57). 

To prove this theorem amounts to proving Theorem 4.2 in the case mi,M2, • • • ,uj are 
consecutive integers, with a bound independent of J. By analyzing the whole argument 
for Theorem 4.2, it follows that the dependency on J in there comes from a single source, 
that is the Maximal Bcsscl inequality in Proposition 5.10. This dependency is eliminated 
by proving the following version of Proposition 5.11. 

Proposition 8.2. Assume S' C S can be organized as a forest T' of trees T with tops T. 
Consider an arbitrary sequence of functions /i^ : R — C, A; e Z satisfying 



■ Mod_,(,,,^.) iPsA^)\ <„,M l/.r^/'-'^xf , n, M > (57) 



feez 

and a function h e L^(R). Define the functions Hg by Hg = ^/iiog(|/s|)- Assume also that 

1/2 



for each T e . 
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2- 



Then 

Proof It suffices to prove tfiat 

E iav'.,3)r<"^"^ 



2) 



ses' 



uniformly in all /c G Z and all / e L^(R). This in turn will follow by duality from the 
following estimate 



ses' ses' 

lis 1=2* |/sl=2* 



which holds for all sequence {as). 
Indeed, 



2 

3||2 



ses' 

|/s|=2fc 



E 


II XI 0^-08,3112 


u!eg3--\oj\=2- 


-fc ses' 


E 




U!ee3:\oj\=2- 


-k s,s'eS' 


E 






-fc s.s'es' 


E 


E I"'!' 


a)663:|w|=2- 




EKi'- 





< 



seS' 
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